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(54) Abstract Title 

An apparatus and method for detecting and decompressing instructions from a variable length 
compressed instruction set 9 

l 5 ^L^w r0prOCeSSOr iS confi 9 ured to fetch a compressed instruction set which comprises a subset of a 
corresponding non-compressed instruction set. The compressed instruction set is a variable length instruction 

l«^ U *£S£ r H Xa H mP H e 16 ' bit T d 'J*- 6 " inStruCtions ' E^ed (e.g. 32-bit) instructions ar ^ coded using an 
extend opcode, which indicates that the instruction being fetched is an extended (e.g. 32 bit) instruction 
Extended instructions have a second opcode field, which is decompressed into a decompressed opcode field 
within a decompressed instruction. For non-extended instructions, the first opcode field is ^m^^Mo 
the decompressed opcode field. The compressed instruction set further includes multiple seu 
mappings from the compressed register fields to the decompressed register fields. Ce^ 
are assigned two opcode encodings, one for each of two mappings of the corresponding reg!ste S The 
compressed register -field is directly copied into a portion of the decompressed register field Si e the 
remaining portion of the decompressed register field is created using a small number of logic gatel The 
fndSST 8 h 3 h mStruCt,on K wtthin the compressed instruction set includes a compression mode whfch 
TttlTn Z ° r tar9et r ° Utine iS C ° ded in pressed instructions. The compression mode is 

stored in the program counter register. The decompression of the immediate field used for load/store 
mstrucfons havng the global pointer register as a base register is optimized for mixed 
ZTn^Zl J n H on - com P ress f ec | instruction execution. The immediate field is decompressed into a 
decompressed .mmediate field for which the most significant bit is set. 
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TITLE: AN APPARATUS AND METHOD FOR DETECTING AND DECOMPRESSING 

INSTRUCTIONS FROM A VARIABLE- LENGTH COMPRESSED INSTRUCTION 
SET * 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention relates to the field of microprocessors and. more particularly, to optimization of the 
instruction set of a microprocessor. 

2. Descripnon of the Relevant Art 



Microprocessor architectcres may generally be classified as either complex instruction set computing 
(CISC) architectures or reduced instruction set computing (RISC) architectures. CISC architectures specify an 
instruction set comprising high level relatively complex instructions. Often, microprocessors impleraenring 
CISC architectures decompose the complex instructions into multiple simpler operations which may be more 
readily implemented in hardware. Microcoded routines stored in an on-chip read-only memory (ROM) have 
been successfully employed for providing the decomposed operations corresponding to an instruction. More 
recently, hardware decoders which separate the complex instructions into simpler operations have been 
adopted by certain CISC microprocessor designers. The x86 microprocessor architecture is an example of a 
CISC architecture. 

Conversely, RISC architectures specify an instruction set comprising low level, relatively simple 
instructions. Typically, each instruction within the instruction set is directly implemented in hardware. 
Complexities associated with the CISC approach are removed, allowing for more advanced implementations to 
be deigned. Additionally, high frequency designs may be achieved more easily since the hardware employed 
to execute the instructions is simpler. An exemplary RISC architecture is the MIPS RISC architecture. 

.Although not necessarily a denning feature, variable-length instruction sets have often been associated 
with CISC architectures while fixed-length instruction sets have been associated with RISC architectures. 
Variable-length instruction sets use dissimilar numbers of bits to encode the various instructions within the set 
as well as to specify addressing modes for the instructions, etc. Generally speaking, variable-length instruction 
sets anempt to pack instruction information as efficiently as possible into the byte or bytes representing each 
instruction. Conversely, fixed-length instruction sets employ the same number of bits for each instruction (the 
number of bits is typically a multiple of eight such that each instruction fully occupies a fixed number of bytes). 
Typically, a small number of instruction formats comprising fixed fields of information are defined. Decoding 
each instruction is thereby simplified to routing bits corresponding to each fixed field to logic designed to 
decode that field. 

Because each instruction in a fixed-length instruction set comprises a fixed number of bytes. locating 
instructions is simplified as well. The location of numerous instructions subsequent to a particular instruction 
is implied by the location of the particular instruction (Le. as fixed offsets from the location of the particular 
instruction). Conversely, locating a second variable-length instruction requires locating the end of the first 
variable-length instruction; locating a third variable-length instruction requires locating the end of the second 
variable-length instruction, etc. Still further, variable-length instructions lack the fixed" field structure of fixed- 
length instructions. Decoding is further complicated by the lack of fixed fields. 

1 



Unfortunately, RISC architectures employing fixed-length instruction sets suffer from problems not 
generally applicable to CISC architectures employing variable -length instruction sets. Because each 
insmicnon is fixed length, certain of the simplest instructions may effectively waste memory by occupying 
bytes which do not convey information concerning the insmicnon For example, fields which art specified as 
"don't cart M fields for a particular instruction or instructions in many fixed-length instruction sets waste 
memory In contrast, variable-length instruction sets pack the instruction information into a minima! number 
ofbytes. 

Still further, since RISC architectures do not include the more complex instructions employed by 
CISC architectures, the number of instructions employed in a program coded with RISC instructions may be 
larger than the number of instructions employed in the same program coded in with CISC instructions- Eacb- 
of the more complex instructions coded in the CISC version of the program is replaced by multiple infractions 
in the RISC version of the program. Therefore, the CISC version of a program often occupies significantly less 
memory than the RISC version of the program. Correspondingly, more bandwidth between devices storing the 
program, memory, and the microprocessor is needed for the RISC version of the program than for the CISC 
version of the program. 

SUMMARY OF THE INVENTION 

The problems outlined above are in large part solved by a microprocessor in accordance with the 
present invention. The microprocessor is configured to fetch a compressed instruction set which comprises a 
subset of a corresponding non^ompresscd instruction seL The non-compressed instruction set may be a RISC 
instruction set, such that the microprocessor may enjoy the high frequency operation and simpler execution 
resources typically associated with RISC architectures. Fetching the compressed instructions from memory 
and decompressing them within the microprocessor advantageously decreases the memory bandwidth required 
to achieve a given level of performance (e.g. instructions executed per second). Still further, the amount of 
memory occupied by the compressed instructions may be comparatively less than the corresponding non- 
compressed instructions may occupy. 

The exemplary compressed instruction set described herein is a variable length instruction set. 
According to one embodiment, two distinct instruction lengths are included: 16-bit and 32-bit instructions. 
The 32-bit instructions are coded using an extend opcode, which indicates that the instruction being fetched is 
an cxiended (e.g. 32 bit) instruction. Instructions may be fetched as 16-bit quantities. When a 16-bit 
instruction having the extend opcode is fetched, the succeeding 16-bit instruction is concatenated with the 
instruction having the extend opcode to form a 3 2 -bit extended instruction. Extended instructions have 
enhanced capabilities with respect to non-extended instructions, further enhancing the flexibility and power of 
the compressed instruction set. Routines which employ the capabilities included in the extended instructions 
may thereby be coded using compressed instructions. 

The compressed instruction set further includes multiple sets of register mappings from the 
compressed register fields to the decompressed register fields. Each value coded in the compressed register 
fields decompresses to a different register within the microprocessor. In one embodiment, the compressed 
register fields compose three bits each Therefore, eight registers are accessible to a particular instruction. In 
order to offer access to additional registers for cenain select instructions, the select instructions are assigned 
two opcode encodings. One of the opcode encodings indicates a firs: mapping of register fields, while the 



second opcode encoding indicates a second mapping of register fields. Advantageously, the compressed 
register fields may include relatively few bits while select instructions for which access to additional registers is 
desired may be granted such access. Additionally, the register mappings are selected to minimize the logic 
employed to decompress register fields. In one embodiment, the compressed register field is directly copied 
5 into a portion of the decompressed register field while the remaining portion of the decompressed register field 
is created using a small number of logic gates. 

The microprocessor supports programs having routines coded in compressed instructions and other 
routines coded in non-compressed instructions. The subroutine call instruction within the compressed 
instruction set includes a compression mode which indicates whether or not the target routine is coded in 
1 0 compressed instructions. The compression mode specified by the subroutine call instruction is captured bv the 
microprocessor as the compression mode for the routine. In one embodiment, the compression mode is stored 
as one of the fetch address bits (stored in a program counter register within the microprocessor). Since the 
compression mode is pan of the fetch address and the subroutine call instruction includes storing a return 
add res s for the subroutine, the compression mode of the calling routine is automatically stored upon execution 
15 of a subroutine call instruction. When a subroutine return instruction is executed, the compression mode of the 
calling routine is thereby automatically restored. 

An additional feature of one embodiment of the microprocessor is the decompression of the immediate 
field used for load/store instructions having the global pointer register as a base register. The immediate field 
is decompressed into a decompressed immediate field for which the most significant bit is set A subrange of 
20 addresses at the lower boundary of the global variable address space is thereby allocated for global variables of 
compressed instructions. Non-compressed instructions may store global variables in the remainder of the 
global variable address space. Advantageously, global variable allocation between the compressed and non- 
compressed routines of a particular program may be relatively simple since the subranges are separate. 

Broadly speaking, the present invention contemplates an apparatus for executing instructions from a 
25 variable-length compressed instruction set, comprising an instruction decompressor. The instruction 
decompressor is coupled to receive instructions which are members of the variable-length compressed 
instruction set, wherein the instruction decompressor is configured to examine an opcode field of a particular 
instruction The instruction decompressor is configured to determine that the particular instruction is an 
extended instruction having a first fixed length if the opcode field is coded as an extend opcode. Additionally. 
30 the instruction decompressor is configured to determine that the particular instruction is a non-extended 
instrucuon if the opcode field is coded as a second opcode different than the extend opcode. 

The present invention further contemplates a method for expanding compressed instructions into 
decompressed instructions. A compressed instruction is determined to be an extended instruction having a 
first fixed length if an opcode field of the compressed instruction is an extend opcode. If the opcode field of 
the compressed instruction is a second opcode different than the extend opcode, the compressed instrucuon is a 
non-extended instruction having a second fixed length. The compressed instruction is decompressed into a 
decompressed instruction. A number of bytes included in the compressed instruction is defined by the first 
fixed length if the compressed instruction is an extended instruction. Alternauvsly. the number of bytes is 
defined by the second fixed length if the compressed instruction is a non-extended instruction. 

The present invention still further contemplates an apparatus for expanding compressed instructions 
into decompressed instructions, comprising a first determining means, a second determining means, and a 
decompressing means. The first determining means determines that a compressed instruction is an extended 
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instruction having a first fixed length if an opcode field of the compressed instruction is an extend opcode. 
Hie second deterriuning means determines that the compressed instruction is a non-extended instruction 
having a second fixed length if the opcode field of the compressed instruction is a second opcode different than 
the extend opcode. The decompressing means decompresses the compressed instruction into a decompressed 
instruction. A number of bytes included in the compressed instruction is denned by the first fixed length if the 
compressed instruction is the extended instruction. Alternatively, the number of bytes is defined by the second 
fixed length if the compressed instruction is the non-extended instruction. 

The present invention yet further contemplates a method for executing a program including a first 
routine and a second routine in a microprocessor. A subroutine call instruction is executed within the first 
routine, wherein the subroutine call instruction indicates that the second routine is to be executed via a target - 
address of the subroutine call instruction. An indication within the subroutine call instruction is examined. If 
the indication is in a first state, the second murine is determined to be coded using compressed instructions. 
The second routine is determined to be coded using non-compressed instructions if the indication is in a second 

state different than the first state. 

Furthermore, the present invention contemplates an apparatus for executing a program including a 
first routine and a second routine in a microprocessor, comprising an executing means and an examining 
means. The executing means executes a subroutine call insmicnon within the first routine. The subroutine 
call instruction indicates that the second routine is to be executed via a target address of the subroutine call 
instruction. The examining means examines an indication within the subroutine call instruction. The 
examining means determines that the second routine is coded using compressed instructions if the indication is 
in a first state. If the indication is in a second state, the examining means determines that the second routine is 
coded using non-compressed instructions. 

The present invention soil further contemplates an apparatus for fetching compressed and non- 
compressed instructions in a microprocessor, comprising a storage device and a mode detector. The stnrage 
device stores a compression enable indicator. Coupled to the storage device, the mode detector is configured to 
detect a compression mode of a target routine upon fetch of a subroutine call instruction specifying the target 
routine. The mode detector is configured to convey the compression mode to a processor core. The processor 
core is configured to fetch compressed instructions if the compression mode indicates compressed. 
Additionally" the processor core is configured to fetch non^ompressed instructions if the compression mode 
indicates non-comprcsscd. 

The present invention yet further contemplates a microprocessor comprising an instruction 
decompressor and a processor core. The instruction decompressor is coupled to receive compressed 
instructions which are members of a variable-length compressed instruction set. The instruction decompressor 
is configured to decompress each received compressed instruction into a corresponding decompressed 
instruction. Coupled to receive decompressed instructions, the processor core is configured to execute the 
decompressed instructions. 

The present invention additionally contemplates a method for executing instruction code. 
Compressed instructions are fetched, wherein the compressed instructions are members of a variable-length 
compressed instruction set The compressed instructions are decompressed in an instruction decompressor, 
thereby forming corresponding decompressed instructions. The decompressed instructions are executed in a 
processor core. 

The present invention still further contemplates an apparatus for executing instruction code. 



comprising a fetching means, a decompressing means, and an executing means. The fetching means fetches 
compressed instructions which are members of a variable-length compressed instruction set. The 
decompressing means decompresses the compressed instructions, thereby forming corresponding decompressed 
instructions. The executing means executes the decompressed instructions. 
5 Furthermore, the present invention contemplates an instruction decompressor configured to 

decompress compressed instructions. A first one of the compressed instructions is codable to access a first 
subset of registers defined for a corresponding non-compressed instruction set. Additionally, a second one of 
the compressed instructions is codable to access the first subset of registers and is further codable to access a 
second subset of registers. 

1 0 ' The present invention further contemplates a method for decompressing compressed instructions. A 

particular compressed instruction having a first register field is decompressed using a first register mapping 
from compressed register indicators to decompressed register indicators if the particular compressed 
instruction is encoded using a first opcode. Alternatively, the particular compressed instruction having the 
first register field is decompressed using a second register mapping from compressed register indicators to 

1 5 decompressed register indicators if the particular compressed instruction is encoded using a second opcode. 

The present invention still further contemplates an apparatus for decompressing compressed 
instructions comprising a decompressing means. The decompressing means is configured to decompress a 
particular compressed instruction having a first register field using a first register mapping from compressed 
register indicators to decompressed register indicators if the particular compressed instruction is encoded using 

2 0 a first opcode. Additionally, the decompressing means is configured to decompress the particular compressed 
instruction using a second register mapping from compressed register indicators to decompressed register 
indicators if the particular compressed instruction is encoded «.qng a second opcode. 

The presem invention yet further contemplates an instruction decompressor configured to decompress 
a compressed register field of a compressed instruction into a decompressed register field of a decompressed 

2 5 instruction. A decompression of the compressed register field is dependent upon a first value coded into the 

compressed register field and a second value coded into an opcode field of the compressed instruction. 

The present invention additionally contemplates a method for decompressing a compressed register 
field of a compressed instruction into a decompressed register field of a decompressed instruction. At least a 
portion of the compressed register field is directly copied into a portion of the decompressed register field. The 

3 0 remaining portion of the decompressed register field is produced by logically operating upon the compressed 

register field. 

Moreover, the present invention contemplates an apparatus for decompressing a compressed register 
field of a compressed insouction into a decompressed register field of a decompressed instruction, comprising 
a first means and a second means. The first means is for directly copying at least a portion of the compressed 

3 5 register field into a portion of the decompressed register field. The first means is coupled to receive the 

compressed register field. Similarly coupled to receive the compressed register field, the second means is for 
logically operating upon the compressed register field to produce a remaining portion of the decompressed 
register field. 

Furthermore, the present invention contemplates an instruction decompressor configured to 

4 0 decompress a compressed register field of a compressed instruction into a decompressed register field of a 

decompressed instruction. The instruction decompressor forms a first portion of the decompressed register 
field by copying at least a portion of the compressed register field thereto. Additionally, the instruction 
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decompressor includes a logic block which is configured to operate upon the compressed register field to 
produce a remaining portion of the decompressed register field. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other objects and advantages of the invention will become apparent upon reading the following 
detailed description and upon reference to the accompanying drawings in which: 

Fig. 1 is a block diagram of one embodiment of a microprocessor. 

Fig. 2 is a block diagram of a second embodiment of a microprocessor. 



Fig. 3 A is a first insuuehon formal supported by one embodiment of the microprocessors shown m 
Figs. I and 2. 

Fig. 3B is a second instruction format supported by one embodiment of die microprocessors shown m 
Figs, i and 2. 

Fig. 3C is a third instruction format supported by one embodiment of the microprocessors shown in 
20 Figs. 1 and 2. 

Fig. 3D is a fourth instruction format supported by one embodiment of the microprocessors shown in 
Figs. 1 and 2. 

2 5 Fig. 4A is a fifth instruction format supported by one embodiment of .the microprocessors shown in 

Figs. 1 and 2. 

Fig. 4B is a sixth instruction format supported by one embodiment of the microprocessors shown in 
Figs. 1 and 2. 

30 . 

Fig. 4C is a seventh insouction format supported by one embodiment of the microprocessors shown in 

Figs. 1 and 2. 

Fig. 4D is an eight instruction format supported by one embodiment of the microprocessors shown in 
35 Figs. I and 2. 

Figs. 5 A, SB, 5C, 5D, and 5E are tables of exemplary instructions using the formats shown in Figs 
3A,3B, 3C,and3D. 

4 o Figs. 6 A, 6B, 6C, 6D. 6E, and 6F are tables of exemplary instructions using the formats shown in 

Figs. 4A t 4B t 4C, and 4D. 
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Fig. 7 is a diagram depicting offsets from an arbitrary register and a global pointer register, accor di ng 
to one embodiment of the microprocessors shown in Figs. 1 and 2, 

Fig. 8 is a block diagram of exemplary hardware for expanding an immediate field from a compressed 
5 instruction to a decompressed instruction. 

Fig. 9 is a diagram depicting decompressed offsets in accordance with one embodiment of the 
microprocessors shown in Figs. 1 and 2. 

10 Fig. 10 is a flow chart depicting operation of a decompressor for immediate fields according to one - 

embodiment of the microprocessors shown in Figs. 1 and 2. 

Fig. 1 1 is a block diagram of exemplary hardware for generating fetch ^rjrffs.w according to one 
embodiment of the microprocessors shown in Figs. 1 and 2. 

15 

Fig. 12 is a block diagram showing register decompression logic employed in one embodiment of the 
microprocessors shown in Figs. 1 and 2. 

Fig. 13 is a block diagram of an exemplary computer system including the microprocessor for which 
2 0 embodiments are shown in Figs. 1 and 2. 

While the invention is susceptible to various modifications and alternative forms, specific 
embodiments thereof are shown by way of example in the drawings and will herein be described in H^i| jt 
should be understood, however, that the drawings and detailed description thereto are not intended to limit the 
2 5 invention to the particular form disclosed, but on the contrary, the intention is to cover all modifications, 
equivalents and alternatives falling within the spirit and scope of the present invention as defined by the 
appended claims. 

DETAILED DESCRIPTION OF THE INVENTION" 

30 

Taming now to Fig. 1. a block diagram of a first embodiment of a microprocessor 10A is shown. 
Microprocessor 10A includes an instruction decompressor 12 A an instruction cache 14 A and a processor core 
16. Instruction decompressor 12A is coupled to receive instruction bytes from a main memory subsystem (not 
shown). Instruction decompressor 12A is further coupled to instruction cache I4A Instruction cache 14A is 

35 coupled to processor core 16. 

Generally speaking, microprocessor 10 A is configured to fetch compressed instructions from the main 
memory subsystem. The compressed instructions are passed through instruction decompressor 12A, which 
expands the compressed instructions into decompressed instructions for storage within instruction cache 14 A 
Many of the compressed instructions occupy fewer memory storage locations than the corresponding 

4 0 decompressed instructions, advantageously reducing the amount of memory required to store a particular 
program. Additionally, since instructions are decompressed within microprocessor 10 A the bandwidth 
required to transport the compressed instructions from the mam memory subsystem to microprocessor 10A is 

7 



reduced Microprocessor 10A may be employed within a computer sysxem having a relatively small main 
memory. Relatively large programs may be stored in the main memory due to the compression of instructions 
stored therein. 

In one embodiment microprocessor 10 A is configured to execute both compressed and non- 
compressed instructions on a routine-by-routine basis. In other words, a routine may be coded using either 
compressed instructions or non<ompressed instructions. Advantageously, routines which may not be 
efficient coded in the compressed instruction set may be coded using non-compressed instructions, while 
routines which are efficiently coded in the compressed instruction set are so coded Microprocessor 10A may 
soppon a particular decompression of the immediate field for load/store instructions using the global pointer 
register as a base register, in order to support mixing of compressed and non^ompressed instructions. The - 
particular decompression is detaiied further below. Additionally, a compression mode is detected by 
instruction decompressor 12A. The compression mode identifies the instruction set in which a murine is 
coded: co mpre ssed or non-compressed. 

Instruction compression is achieved in microprocessor 10A by imposing certain limitanons upon the 
available instruction encodings. By limidng the instruction encodings, instruction field sizes may be reduced 
(Le. the number of bits within an instruction field may be decreased}. For example, the number of available 
registers may be reduced to form the compressed instruction set Because fewer registers are available, a 
smaller field may be used to encode the registers used as source and destination operands for the instruction. 
Instruction decompressor 12A expands the encoded register field into a decompressed register field. The 
decompressed register field is included in the decompressed instruction. The compressed instructions use the 
reduced instruction fields, thereby occupying less memory (i.e. fewer bits) than the original instruction 
encodings defined by the microprocessor architecture employed by processor core 16. 

Instruction decompressor 12A is configured to accept compressed instructions and to decompress the 
instructions into the original instruction encodings. Each instruction field within a particular compressed 
instruction is expanded from the compressed field to a corresponding decompressed field within the 
corresponding decompressed instruction. The decompressed instruction is coded in the original instruction 
format supported by processor core 16. 

Processor core 16 includes circuitry for fetching instructions from instruction cache 14 A. decoding the 
instructions, and executing the instructions. The instructions supported by processor core 16 art specified by 
the microprocessor architecture employed therein. In one particular embodiment, processor core 16 employs 
the MIPS RISC architecture. However, it is understood that processor core 16 may employ any microprocessor 
architecture. Since instruction decompressor 12A decompresses instructions into the original instruction 
format, processor core 16 may comprise a previously designed processing core. In other words, the processing 
core may not require substantial modification to be included within microprocessor 10 A. 

The MIPS RISC architecture specifies an instruction set comprising 32 bit fixed-length instructions. 
A compressed instruction set is defined for microprocessor 10 A which comprises variable-length instructions. 
Many of the compressed instructions comprise 16-bit instructions. Other compressed instructions comprised 
32 bit instructions in conjunction with the extend instruction described below. Several 16-bit and 32-bit 
instruction formats are defined It is understood that, although 16-bit and 32 -bit compressed instructions are 
used in this embodiment, other embodiments may employ different .instruction lengths. The compressed 
instructions encode a subset of the no n -compressed instructions. Instruction encodings supported within the 
compressed instruction se: comprise many of. the most commonly coded instructions as well as the most often 



used registers, such that many programs, or routines within the programs, may be coded using the compressed 
instructions. 

In one embodiment, microprocessor 10A employs a compression mode. If the compression mode is 
active, then compressed instructions are being fetched and executed. Instruction decompressor 12A 
decompresses the instructions when they are transferred from mai^ memory to instruction cache 14. 
Alternatively, the compressed mode may be inactive. When the compression mode is inactive, non-compressed 
instructions are being fetched and executed. Instruction decompressor 12 A is bypassed when the compressed 
mode is inactive. In one particular embodiment, the compression mode is indicated by a bit within the fetch 
address (e.g. bit 0). The current fetch address may be stored in a PC register 13 within processor core 16. Bit 
0 of PC register 18 indicates the compression mode (CM) of microprocessor 10 A 

Instruction cache 14 A is a high speed cache memory configured to store decompressed and non- 
compressed instructions. Although any cache organization may be employed by insmiction cache 14 A, a set 
associative or direa mapped configuration may be suitable for the embodiment shown in Fig. 1. 

Turning next to Fig. 2, a second embodiment of a microprocessor iOB is shown. Microprocessor 10B 
includes an instruction cache 14B coupled to receive instruction bytes from the main memory subsystem, an 
instruction decompressor 12B, and processor core 16. Instruction cache 14B is coupled to instruction 
decompressor 12B, which is further coupled to processor core 16. 

Microprocessor 10B is configured with instruction decompressor 12B between instruction cache 14B 
and processor core 16. Instruction cache I4B stores the compressed instructions transferred from the main 
memory subsystem. In this manner, instruction cache 14B may store a relatively larger number of instructions 
than a similarly sized instruction cache employed as instruction cache 14 A in microprocessor 10 A. Instruction 
decompressor I2B receives fetch addresses corresponding to instruction fetch requests from processor core 16, 
and accesses instruction cache 14B in response to the fetch request. The corresponding compressed 
instructions are decompressed into decompressed instructions by insmiction decompressor I2B. The 
decompressed instructions are transmitted to processor core 16. 

Similar to microprocessor 10 A, microprocessor 10B includes a compression mode in one 
embodiment. Instruction decompressor 12B is bypassed when non-compressed instructions are being fetched 
and executed. For this embodiment, insmiction cache 14B stores both compressed and non-compressed 
instructions. It is noted that instruction cache 14B typically stores insmiction bytes in fixed-size storage 
locations referred to as cache lines. Therefore, a particular cache line may be storing compressed or non- 
compressed instructions. In either case, a plurality of instruction bytes are stored Therefore, instruction 
caches I4A and 14B may be of similar construction. The compression mode at the time a cache line is 
accessed determines whether the instruction bytes are interpreted as compressed or non-compressed 
instructions. 

An alternative configuration for microprocessor 10B is to include instruction decompressor 12B 
within the instruction decode logic of processor core 16. The compressed instructions may not actually be 
decompressed in such an embodiment. Instead, the compressed instructions may be decoded directly by the 
decode logic. The decoded instructions may be similar to the decoded instructions generated for the non- 
compressed instructions which correspond to the compressed instructions. 

It is noted that microprocessors 10A and 10B are merely exemplar/ embodiments of a microprocessor 
10 which operates upon compressed instructions. For the remainder of this discussion, microprocessor 10, 
instruction cache 14, and instruction decompressor 12 will be used to refer to the corresponding elements of 




both Figs. I and 2 as well as other embodiments of the elements included in other implementations of 
microprocessor 10. 

The terms deco mpr ession, compressed instruction, decompressed instruction, and non-compressed 
instruction are used in the above discussion and may further be used below. As used herein, the term 
5 "compressed instruction " refers to an insmiction which is stored in a compressed form in memory. The 

compressed instruction is generally stored using fewer bits than the number of bits used to store the instruction 
when represented as defined in the microprocessor architecture employed by. processor core 3 6 . The term 
"decompressed instruction" refers to the result of expanding a compressed instruction into the original 
encoding as defined in the nuanprocessor architecture employed by processor core 16. The term a non- 

1 o compressed instruction" refers to an instruction represented in the encoding denned by the microprocessor 
architecture employed by processor core 16. Non-compressed instructions are also stored in memory in the 
crm^ format (Le. non-compressed instructions were never co m p r e s sed). Finally, the term "decompression" 
refers to the process of expanding a compressed instruction into the corresponding decompressed instruction. 
It is noted that instniction decompressors 12 A and 12B may be configured to simultaneously 

15 decompress multiple compressed instructions. Such embodiments of instruction decompressors 12 maybe 
employed with embodiments of processor core 16 which execute multiple instructions per clock cycle. 

Figs. 3A-3D and 4A-4D depict exemplary instruction formats for 16-bit and 32-bit compressed 
instructions, respectively, according to one specific embodiment of microprocessor 10 employing the MIPS 
RISC architecture. Other instructions formats may be employed by other embodimems. The instruction 

20 formats shown in Figs. 3A-3D each comprise 16 bits in this particular implementation. Conversely, the 
instruction formats shown in Figs. 4A-4D each comprise 32 bits in this particular implementation. The 
compressed instructions encoded using the insmiction formats are decompressed into instruction formats as 
defined by the MIPS RISC architecture for each instruction. 

Fig. 3 A depicts a first instruction format 20. Instruction format 20 includes an opcode field 22, a first 

25 register field 24, a second register field 26, and a function field 28. Opcode field 22 is used to identify the 

instruction. Additionally, function field 28 is used in conjunction with certain particular encodings of opcode 
field 22 to identify the instruction. Effectively, function field 28 and opcode field 22 together form the opcode 
field for these instructions. When opcode field 22 employs certain other encodings than the particular 
encodings, function field 28 is used as an immediate fieid. 

3 o First register field 24 and second register field 26 identify destination and source registers for the 

instruction. The destination register is also typically used as a source register for the instruction. In this 
manner, two source operands and one destination operand are specified via first register field 24 and second 
register field 26. The notations "RT and M RS" in first register field 24 and second register field 26 indicate 
the use of the fields in the instniction tables below. Either RT or RS may be a destination register, depending 

3 5 upon the encoding of the instructioa 

In one embodiment, opcode field 22 comprises 5 bits, first register field 24 and second register field 
26 comprise 3 bits each, and function field 28 comprises 5 bits. First register field 24 is divided into two 
subfields (labeled RT1 and RTO). RT1 comprises two bits in the present embodiment, while RT0 comprises 
one bit. RT1 is concatenated with RTO to form first register field 24. Subfield RT1 and second register field 

4 0 26 are used in certain instructions encoded via instruction format 20 to indicate one of the 32 registers defined 

by the MIPS RISC architecture. 

Fig. 3B depicts a second instruction format 30. Instruction format 30 includes opcode field 22, first 



register field 24, and second register field 26. Additionally, a third register field 32 and a function field 34 arc 
shown. Third register field 32 is generally used to identify the destination register for iiistructions using 
instruction format 30. Therefore, first register field 24 and second register field 26 comprise source registers 
for instruction format 30. Function field 34 is used similar to function field 28. In the embodiment shown, 
5 third register field 32 comprises three bits and function field 34 comprises two bits. 

A third instruction format 40 is shown in Fig. 3C. Instruction format 40 includes opcode field 22 and 
second register field 26, as well as an immediate field 42. Immediate field 42 is used to provide imm~K*t* 
data for the instruction specified by instruction format 40. Immediate data is an operand of the instruction, 
similar to the value stored in a register specified by first register field 24 or second register field 26. For 
1 0 example, an add instruction which uses immediate data adds the immediate data to the value stored in the 

destination register, and stores the resulting sum into that destination register. In one embodiment, immediate 
field 42 comprises eight bits. Immediate field 42 is divided into two subfields (IMM1 and IMM0) in the 
instruction format shown in Fig. 3C The subfields allow second register field 26 to be placed in the bit 
positions within instruction format 40 as it is placed in instruction formats 20 and 30. Advantageously, second 
15 register field 26 is always found in the same position of 16-bit instructions in which it is used. Therefore. 

sufafield IMM1 comprises 2 bits and subfield IMM0 comprises 6 bits. IMM1 is concatenated with IMM0 to 
form the immediate value. 

Fig. 3D depicts a fourth instruction format 50. Instruction format 50 includes opcode field 22 and an 
immediate field 52. Immediate field 52, similar to immediate field 42, is used as an operand of the instruction. 
2C However, immediate field 52 comprises 1 1 bits. 

Fig. 4A depicts a fifth instruction format 60. Instruction format 60 includes opcode field 22, which is 
coded as the extend instruction. Instruction decompressor 12 recognizes the extend instruction opcode within 
opcode field 22 and treats the current instruction as a 32-bit instruction (i.e. the 16 bits mrh«W| ^ ^ 
instruction containing the extend opcode and the 16 bits which would otherwise comprise the next instruction 

2 5 in program order are concatenated to form a 32 bit instruction). Therefore, the compressed instruction can be 

seen to be a variable-length instruction set comprising 16-bit instructions and 3 2 -bit instructions. Instruction 
format 60 further includes a zero field 62 comprising six bits (coded to all binary zeros), an immediate field 
64, and a BR field 66. Instruction format 60 is used to code an extended form of the BR instruction (an 
unconditional branch instruction), and hence BR field 66 is an opcode field indicating the BR instruction. In 

3 0 one embodiment, the BR opcode is hexadecimal 02. 

The extended BR instruction has a larger immediate field than the non-extended BR instruction, and 
therefore may be coded with larger offsets than the non-extended BR instruction. When a branch to an 
instruction distant from the branch instruction is desired, the extended BR instruction may be used. 
Alternatively, branches to close instructions may use the non-extended BR instruction. Immediate field 64 
3 5 comprises 16 bits which are used as an offset to be added to the address of the instruction following the BR 
instruction to create the target address of the branch instruction. The non-extended BR instruction, by 
contrast, includes an eleven bit offset (i.e. it is coded using instruction format 50). 

Fig. 4B depicts an instruction format 70 which is an extended version of insiruction format 40. 
Instruction format 70 includes opcode field 22 coded as the extend opcode, as well as an immediate field 72, a 
first register field 74. a second register field 76, and a second opcode field 78. First register field 74 and 
second register field 76 comprise five bits each in the embodiment shown. Tnerefore, any register defined by 
the MIPS RISC architecture may be accessed using instruction formal 70. Second opcode field 78 defines the 
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uistmcrioa being executed. ^ comprises 5 bits (similar to opcode field 22). Finally, immediate field 
comprises 12 bits divided into a one bit IMM2 subfield. a five bit IMM1 subfield. and a si* bit IMMO suoncld. 
immediate field 72 is formed by concatenating IMM2 with IMM1 and further with IMMO in the emoodimenr 
shown. 

An ext-nded instruction format corresponding to instruction format 30 is shown m Fig. 4C as an 
instruction format 80. Instruction format 80 includes opcode field 22, first register field 74, second regret 
field 76 and second opcode field 78, similar to instruction format 70. Additionally, mstrucaon format 80 
includes a third register field 82 and a function field 84. Third register field 82 is similar to third register field 
32 except that third register field 82 comprises five bits. Therefore, any MIPS RISC architecture regoster may 
be specified by third register fields. Function field 84 is similar to function fields 28 and 34, except that 

function field 84 comprises six bits. 

Second opcode field 78 is coded to a particular value to identify instruction format 80 from instruction 
format 70 When second opcode field 78 is coded to the particular value, instruction format 80 is assumed by 
instruction decompressor 12. Conversely, when second opcode field 78 is coded to a value other than the 
particular value, instruction format 70 is assumed by instruction decompressor 12. In one embodiment, the . 
particular value comprises hexadecimal 00. 

Instruction format 80 further includes a COP0 bit 86. COP0 bit 86. when set, indicates that certain 
coprocessor zero instructions (as defined in the MIPS RISC architecture) are being executed The tables of 
instructions below further define the instructions encoded by setting COP0 bit 86. 

The instructions defined for instruction formats 20. 30. 40, and 50 are capable of performing many of 
the operations commonlv performed in typical programs. However, routines may need to perform operations 
of which these instructions are incapable. While most of the instructions in the routine may be coded using 
instruction formats 20-50. several instructions may require additional encodings. For example, access to a 
register not included within the subset of available registers in formats 20-50 may be needed. Additional 
instructions not included in the instructions encoded using formats 20-50 may be needed. For these and other 
reasons the extend opcode and extended instruction formats 60-80 are defined. 

' Instruction decompressor 12 examines opcode field 22 in order to detect the extend opcode. The 
extend opcode is one of the opcodes defined to use instruction format 50 in the present embodiment, although . 
the bits included in immediate field 52 are assigned differing interpretations depending upon the extended 
instroction format coded for the particular extended instruction. The extended instruction formats include a 
second opcode field (e.g. fields 66 and 78) which identify the particular extended mstrucaon. 

Addition of the extend opcode and extended instruction formats allows for many instructions to be 
encoded using the narrower instruction formats 20-50. but still have the flexibility of the wider extended 
instruction formats when desired. Programs which occasionally make use of the functionality included in the 
extended instruction formats may still achieve a reduced memory footprint, since these programs may be 
encoded using compressed instructions and many of the compressed instructions may comprise 16-bn 

compressed instructions. 

An embodiment of microprocessor 10 may handle the extended instructions by fetching 16-bit 
instruction portions and detecting the extend opcode. When the extend opcode is detected, a NOP maybe 
transmitted to processor core 16 and the remaining 16-bit portion of the extended instruction may be fetched. 
The extended instruction is decompressed and provided as the next instruction after the NOP. 

Additionally.' instruction decompressor 12 handles cases wherein a portion of the extended .nsmruon 




is available while a second portion is unavailable. For example, two portions of the extended instruction may 
lie within two distinct cache lines within instruction cache 14. Therefore, one portion of the instruction may 
be fetched from instruction cache 14 while the other portion may not reside within instruction cache 14. The 
portion may then need to be stored within instruction decompressor 12 until the remaining portion is available 
5 Finally, Fig. 4D is an instruction format 90 used to explicitly expand the JAL instruction of the MIPS 

RISC instruction set. Tat JAL instruction is often used as a subroutine call instruction. Subroutines may be 
stored in memory at a great distance (address-wise) from the calling murine. Therefore, having the largest 
possible range of relative offsets (via an immediate field 92 comprising 26 bits) is important for the JAL 
- instrucuoo. Additionally, an exchange bit 94 is included in the instruction encoding. The exchange bit is used 

10 to indicate the compressed/non-compressed nature of the instructions at the target address. If the bit is set, the 
target instructions are compressed instructions. If the bit is clear, the target instructions are non-compressed 
instructions. The value of exchange bit 94 is copied into bit 0 of the program counter within processor core 16. 
Bit 0 of the program counter may always be assumed to be zero, since the sixteen bit and thirty-two bit 
instructions occupy at least two bytes each and instructions are stored at aligned addresses. Therefore, bit zero 

15 is a useful location for storing the compression mode of the current routine. Processor core 16 increments 
fetch addresses by 2 (instead of 4) when bit 0 is set. thereby fetching 16 bit compressed instructions through 
instruction decompressor 12. 

Each instruction within the compressed instruction set employed by microprocessor 10 uses at least 
one of the instruction formats shown in Figs. 3A-3D and Figs. 4A-4D. It is noted that opcode field 22 is 

2 0 included in each instruction format, and is located in the same place within each instruction format. The 
coding of opcode field 22 determines which instruction format is used to interpret the remainder of the 
instruction. A first portion of the opcode field encodings is assigned to instruction formal 20; a second portion 
of the opcode field encodings is assigned to instruction format 30; etc. 

As used herein, the term "instruction field" refers to one or more bits within an instruction which are 

2 5 grouped and assigned an interpretation as a group. For example, opcode field 22 is comprises a group of bits 

which are interpreted as the opcode of the instruction. Additionally, first and second register fields 24 and 26 
comprise register identifiers which identify a storage location within processor cere 16 which store operands of 
the instruction. Additionally, the term immediate field refers to an instruction field in which immediate data is 
coded, immediate data may provide an operand for an instruction. Alternarrvery. immediate data may be used 
30 as an offset to be added to a register value, thereby producing an address. Still further, immediate data may be 
used as an offset for a branch instruction. 

Figs. 5 A-6F are tables listing an exemplary compressed instruction se: for use by one particular 
implementation of microprocessor 10. The particular implementation employs the MPS RISC architecture 
within processor core 16. Therefore, the instruction mnemonics listed in an instruction column 100 of the 

3 5 tables correspond to instruction mnemonics defined in the MIPS RISC architecture (or defined for the 

instruction assembler, as described in "MIPS RISC Architecture" by Kane and Hsinrich. Appendix D, Prentice 
Hall PTR. Upper Saddle River, New Jersey, 1992, incorporated herein by reference) with the following 
exceptions: CMPI, MOVEI, MOVE, NEG, NOT. and extend. These instructions translate to the following 
MIPS instructions (RS and RT refer to the 16-bit RS and RT); 

4 0 

CMPI XORI S24. RS. imm8 

MOVI ADDrU RS, SO. simrnS 
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MOV ADD. RS.tt.RT 

KEG SUB RS, $0, RT 

NOT NOR RS, $0, RT 

extend (described above) 

Additionally, the instruction tables use several symbols. In an operands column 102, the symbols rs, 
it xs, xi, and ni are used Rs and xs refer to second register field 26 (or second register field 76), while rt and 
xi refer to first register field 24 (or first register field 74). Similarly, rd refers to third instruction field 32 (or 
third instruction field 82). As mentioned for one embodiment above, first register field 24, second register 
field 26, and third register field 3 2* comprise three bits each. ' Table 1 below lists the mapping of the field - 
encodings (listed in binary) to registers in the MIPS RISC architecture for these symbols. Other mappings are 
also contemplated, as shown further below. Names assigned according to MIPS assembler convention are also 
listed in Table 1 . 
Table 1: Register Mappings 



Field Encodine 


R5. RT, Rp 


XS. XT 


000 


$8(t0) 


$24 (t8) 


001 


SI (at) 


$17 (si) 


010 


$2(v0) 


$18 (s2) 


011 


$3 (vl) 


$19 (S3) 


100 


$4(a0) 


$28 (gp) 


101 


$5 (al) 


$29 (sp) 


110 


$6(a2) 


$30 (s8) 


111 


$7(a3) 


$31 (ta) 



As shown in table 1. up to 16 registers are available for use in compressed instructions having registers fields 
24, 26, or 32. Because each register field is three bits, only eighi registers are available for a given opcode, 
instructions which may access all sixteen registers are assigned cwo opcodes in the instruction tables below. 
Register selection is thereby a function of both a register field and opcode field 22. Advantageously, register 
fields may be encoded using fewer bits while still providing select instructions which may access a large group 
of registers. 

Also listed in operands column 102 are symbols for the imm ediate fields 32, 42, 64, and 72. The 
symbol w imm" indicates an immediate field is included. If "imnT is preceded by an V, the immediate field is 
signed and the decompression of the immediate field into the decompressed instruction is performed by sign 
extending the immediate field. If "imm" is not preceded by an m S n , the immediate field is unsigned and 
immediate field decompression involves zero extending the immediate field. In one embodiment, immediate 
field decompression for load/store instructions comprises right rotation of the immediate bits by one bit for 
halfwords and two bits for words, followed by shifting of the immediate bits left by one bit for halfwords and 
two bits for words. Effectively, a seven bit immediate field is provided for words and a six bit immediate field 
for halfwords (in the 16-bit instruction formats). The MIPS RISC architecture defines that data addresses 
corresponding to load/store instructions are aligned for each instruction included in the exemplary compressed 
instruction set. Therefore, the least significant bit (for halfwords) and the second least significant bit (for 
words) may be set to zero. Bits in the compressed immediate field need not be used to specify these bits. 
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Finally, a imm" is post-fixed with a number indicating the number of bits included in the immediate field- 
Opcode field 22 and function field 28 are decompressed as welL More particularly, opcode field 22 
and function field 28 identify the instruction within the MIPS RISC architecture, in accordance with the tables 
shown in Figs. 5A-6T. The opcode and function fields of the decompressed instructions are coded in 
accordant with the MIPS RISC architecture definition. 

Figs. 5A and SB depict a table 1 10 and a table 1 12, respectively. Tables 1 10 and 1 12 list instructions 
from the exemplar}' compressed instruction set which use instruction format 20 shown in Fig. 3 A. Instruction 
column 100 and operands column 102 are included, as well as an opcode column 106 and a function column 
104. Opcode column 104 and function column 106 include hexadecimal numbers, and correspond to opcode 
field 22 and function field 28, respectively. 

Table 1 10 includes several instructions which have an m imm5 n coding in function column 104. The 
•imxnf" coding appears for the load/store instructions within table 1 10, and indicates that function field 28 is 
used as an immediate field for these instructions. For other instructions, function field 28 is used in 
conjunction with opcode field 22 to identify a particular instruction within the compressed instruction set 
Additionally, opcode Id is labeled as special in table 110. The special instructions have a sped&c 
interpretation of function field 28. In particular, if the most significant bit of the function field is clear, then 
the instruction is defined to be: 

ADDIU rt, rs, simm4 

wherein the "simnvT operand is formed from the remaining bits of function field 28. If the most significant 
bit of function field 28 is set, the instruction is H^firvH to be: 



ADDIU xt. xs, simm4 

except for two special cases. If second register field 26 is coded to a zero, then the instruction is: 



MOVEI xt, imra4 



wherein again the imm4 operand is formed from the remainder of function field 28. Lastly, if second register 
field 26 is coded to 5 (hexadecimal), then the instruction is defined to be: 

ADDIU sp, sumn9 

wherein the simm9 operand is formed from the remaining bits of function field 28 and first register field 24. 
The low order two bits of the simm9 operand are set to zero. 

It is noted that the destination of the SLT and SLTU insrrucrions shown in table 1 10 is the t$ register 
(register S24) according to one embodiment 

Table 1 12 shows an "immT and "imm6" operand for several instructions. The imm3 operand is 
coded into second register field 26, and the "imm6" operand is coded into both second register field 26 and 
first register field 24. 

Additionally, table 1 12 includes the jump register (JR) instruction, having second register field 26 as 
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an operand However, it is noted that in one embodiment subfield RT1 of first register field 24 is used in 
conjunction with second register field 26 10 specify any of the MIPS RISC architecture registers for the JR 
instruction. 

Turning now to Fig. 5C, a table 114 including instruction column 100, operands column 102, opcode 
column 106. and function column 104. Table 1 14 lists instructions from the exemplary instruction set which 
use instruction format 30 shown in Fig. 3B. Certain instructions within table 1 14 have hardcoded destination 
registers (Le. the destination registers cannot be selected by the programmer, other than by using a different 
opcode). For these instrucdons. third register field 32 is combined with function field 34 to store the function 
field encoding shown in function column 104. Additionally, an instruction is shown which has an immediate 
operand in function coiumn 104 and operands column 102. This instruction uses second register field 26 in - 
conjunction with function field 34 to code the corresponding immediate field used by the instruction. 

Figs 5D and 5E are tables 1 16 and 118 showing the instructions from the exemplary compressed 
instruction set which employ mstruction fomats 40 and 50, raqjectivery. It is noted that the extend instruction 
is shown in table 118. However, the extend instruction actually indicates that the instruction is a 32-bit 
compressed infraction which uses one of instruction formats 60, 70, or 80. 

Turning now to Figs. 6A and 6B. a table 120 and a table 122 are shown. Tables 120 and 122 depict 
those instrucdons from the exemplary compressed instruction set which are encoded using instruction format 
70, shown in Fig. 4B. 

Table 120 includes instruction column 100 and operands column 102. and further includes an opcode 
column 108. Opcode column 108 is similar to opcode column 106. except that the opcode encodings shown in 
opcode column 108 correspond to opcode field 78. 

Table 122 includes an RT column 109 which corresponds to first register field 74. The coding of the 
RT field in the instructions shown in table 122 indicates which instruction is selected. The instructions shown 
in table 1 22 share a specific encoding in opcode field 78. In one embodiment, the specific encoding u 00 
(hexadecimal). 

Figs. 6C, 6D. 6E. and 6F are tables 124. 126, 128. and 130 which depict instructions from the 
exemplar.- compressed instruction set which are encoded according to instruction format 80. Tables 124, 126, 
and 130 include a function column 107 which corresponds to encodings of function field 84. Table 128 
includes an RS. RT column 105 which will be explained in more detail below. 

Operands column 102 for table 124 includes immediate operands for certain instructions. The 
•imm5" operand is coded into second register field 76. The "imml5" operand is coded into a combination of 
first register field 74, second register field 76, and third register field 82. 

The instructions listed in table 128 arc identified via encodings of second register field 76, as shown 
in RS, RT column 105. Certain instructions are identified via second register field 76 in conjunction with first 
register field 74. Those instructions for which RS. RT column 105 includes an asterisk for the RT portion art 
identified via second register field 76, while those instructions for which RS. RT column 105 does not include 
an asterisk axe identified by second register field 76 in conjunction with first register field 74. Instructions 
which are not identified via first register field 74 may use first register field 74 to encode an operand. The 
instructions listed in tables 128 and 130 are instructions for which COPO bit 86 is set, while instructions listed 
in tables 124 and 126 are encoded with COPO bit 86 clear. 

Certain instructions in table 128 include an "imm6" operand. The "imm6" operand is coded into 
function field 84. Additionally, function field 84 is used to indicate the instructions shown in table 130 when 
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second rcgisicr field 76 is coded to Ix (hexadecimal), wherein "x H indicates that the low order bits are dont 
cared. 

Turning now to Fig. 7, a first addressing window 150 and a second addressing window 152 are shown 
according to one embodiment of microprocessor 10. At the center of addressing window 150 is the value of a 
base register (represented as Reg. on the left side of addressing window 150). The value of the base register 
identifies an address within the main memory subsystem. Addressing window 150 represents the range of 
addresses around the value of the base register which arc accessible to a load/store instruction in the non- 
compressed instruction set according to one embodiment of the non-compressed instruction set. The non- 
compressed instruction set specifies that load/store instructions form the aHrir^ 0 f a memory operand via the 
Aim of a value stored in a base register and a sixteen bit signed immediate field In such an embodiment, the- 
range of addresses has an upper boundary of 32767 greater than the base register and a lower boundary of 
3276S less than the base register. Other embodiments may include larger or smaller ranges. As used herein, 
the term "base register" refers to a register which is specified by a load/store instruction as storing a base 
address, to which the signed immediate field is added to form the address of the memory operand operated 
upon by the instruction. 

As shown in table i 10, for example, load/store instructions within the 16-bit portion of the exemplary 
compressed instruction set include a five bit immediate field This field is rotated right two bits and then 
shifted left two bits for word-sized memory operands, forming a seven bit immediate field (the largest of the 
immediate fields which may be formed using the five bits, according to one embodiment). The seven bit 
immediate field is then zero extended to form a positive offset from the base register in the corresponding 
decompressed instruction. A subrange 154 of addresses are therefore available for access by compressed 
instructions. Within addressing window 150, subrange 154 has an upper boundary of 127 greater rhan the 
base register and a lower boundary of the base register. However, subrange 154 may vary in size from 
embodiment to embodiment. 

While subrange 154 may work well for many load/store instructions, a different subrange may be 
employed for use with the global pointer register. The global pointer register is a register assigned by software 
convention to locate an area of memory used for stonng global variables. A global variable is a variable which 
is available for access from any routine within a program. In contrast, a local variable is typically accessible 
only to a particular routine or group of routines. In the MIPS instruction set. for example, register S2S is often 
used as the global pointer register. 

The area of memory around the global pointer register may therefore be viewed as a table of global 
variables. Each global variable is assigned an offset within the table. The offset corresponds to a particular 
immediate field value which may be added to the global pointer register in order to locate the global variable. 
For the embodiment shown in Fig. 7, for example, a 64 kilobyte table may be allocated for global variables as 
shown along the left side of addressing windows 150 and 152. 

If compressed immediate fields are decompressed as described for addressing window 150. then the 
global variable table includes a section which is accessible to compressed instructions (corresponding to 
subrange 154) which is between two subranges 156 and 158 accessible to non^ompresscd instructions. As 
noted above, microprocessor 10 may support programs in which some routines are coded with non -compressed 
instructions while other routines are coded with compressed instructions. Allocating global variables in a 
particular program is complicated by the division of the non^ompressed global variable subranges 156 and 
15$ of addressing window 150. Global variables may be allocated into subrange 158. for example, and then 
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global variable allocation must conrinue in subrange 156 (for non-compressed instructions). In other words, 
subrange 154 must be bypassed for global variables accessible to non-compressed instructions. 

Microprocessor 10 may employ a decompression of the compressed immediate field for load/ store 
instructions using the global pointer (GP) register which leads to addressing window 152. Addressing window 
152 includes a subrange 160 accessible to compressed instructions and a subrange 162 accessible to non- 
compressed instructions. Advantageously, subrange 162 is a contiguous block of m emo ry. Global variables for 
a cr ^ 95 by non-compressed instructions may be allocated into subrange 162, while global variables for access by 
compressed instructions may be allocated into subrange 160. Essentially, subrange 160 and subrange 162 form 
distinct tables of global variables for access by compressed and non-compressed instructions, respectively. 

Addressing window 1 52' is achieved by decompressing the compressed immediate field as described- 
above, except that the most significant bit of the decompressed immediate field is set If the compressed 
fmTn fWiatff field is coded with binary zeros, then the decompressed immediate field is S000 (in hexadecimal). 
Since the decompressed immediate field is interpreted as a signed field for load/store instructions, the 8000 
value is the most negative number available in the decompressed immediate field. Other encodings of the 
compressed immediate field are decompressed into negative numbers which form subrange 160. Subrange 160 
forms the lower boundary of the range of addresses represented by addressing window 152 as shown in the 
embodiment of Fig. 7. 

As used herein, the term memory operand refers to a value stored in a memory location within the 
main memory subsystem. Load instructions may be used to transfer the memory operand to a register within 
microprocessor 10. Conversely, store instructions may be used to transfer a value stored in a register into the 
memory operand storage location. A memory operand may be of various sizes (i.e. numbers ofbytes). In one 
embodiment, three sizes are available: byte, halfword, and wor± A halfword comprises two bytes, and a word 
comprises four bytes. Other memory operand sizes are contemplated for other embodiments. 

Turning to Fig. 8, a block diagram of exemplary hardware within instruction decompressor 12 for 
decompressing the immediate field of a load/store instruction is shown. It is noted that multiple copies of the 
exemplary hardware shown in Fig. S may be employed to concurrently decompress multiple load/store 
instructions. The exemplary hardware shown in Fig. S is described in terms of microprocessor 10B. However, 
similar hardware may be employed within microprocessor 10 A. Tne exemplary hardware includes a 
immediate field decompressor 170 and a register decoder 172. 

When an instruction is conveyed to instruction decompressor 12B from instruction cache UB, a 
portion of the instruction comprising the compressed immediate field for load/store instructions is conveyed to 
immediate field decompressor 170 upon a compressed immediate bus 174. For the exemplary instruction set 
described in Figs. 3A-6F, the compressed immediate field comprises function field 28 (shown in Fig. 3 A). 
Additionally, the base register field for the compressed load/store instruction is conveyed upon a base register 
bus 176. For the exemplary instruction set shown in Figs. 3 A-6F, the base register field comprises second 
register field 26. 

Register decoder 172 decodes the register identified upon base register bus 176. If the bass register is 
the global pointer register, register decoder 172 asserts a GP signal upon GP line 178 to immediate field 
decompressor 170. Otherwise, register decoder 172 deassens the CP signal. 

Immediate field decompressor 170 decompresses the compressed immediate field in one of two ways, 
dependent upon the G? signal. If the GP signal is deasserted. then immediate field decompressor 170 clears 
the most significant bit of the decompressed immediate field. Conversely, immediate field decompressor 170 
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• sets the most significant bit of the immediate field if the GP signal is asserted Therefore, a positive offcet is 
created when a register other than the global pointer register is used as the base register. A negative oftet is 
created when the global pointer register is used as the base register. Immediate field decompressor 170 

- conveys the decompressed immediate field upon a decompressed immediate bus i80. 

Fig. 9 illustrates the decompressed immediate field generated for load/store instructions according to 
one embodiment of the exemplary compressed instruction set. The compressed immediate field of load/store 
instructions which do not employ the global pointer register as the base register are decompressed as indir ^ 
by reference number 182. The decompression for bytes, halfwords, and words are shown separately, with each 

- bit position of the decompressed immediate field (or offset) represented by a numerical di^i or an "L\ Bits 
from the compressed immediate field art shown in the respective bit locations of the decompressed field via the 
numerical digits. The least significant bit of the compressed immediate field is represented by the digit 0, and 
the most significant bit of the compressed immediate field is represented by a 4. The letter T" is used to 
indicate a bit position which is set to a binary zero. 

Decompressed immediate fields corresponding to bytes, halfwords, and words for load/store 
instructions which use the global pointer register as a base register are indicated by reference number 184. 
Similar to the decompressed fields indicated by reference number 182, the decompressed fields indicated by 
reference number 184 depict numerals in bit positions which are filled with a bit from the compressed 
immediate field and the letter T" is used to indicate a bit position which is set to a binary zero. Additionally, 
the most significant bit of each decompressed offset is set toia binary one (indicated by the letter TT)- 

Turning next to Fig. 10. a flow chart is shown depicting activities performed by instruction 
decompressor 12 in order to decompress instructions in accordance with the embodiment shown in Fig. 8. 
Although the steps shown in Fig. 10 are illustrated as serial in nature, it is understood that various steps may 
be performed in parallel. 

Instruction decompressor 12 determines if a received instruction is a load/store instruction (decision 
block 190). If the instruction is not a load/store instruction, the instruction is expanded in accordance with a 
mapping between the compressed instructions (as illustrated in Figs. 3AhSF) and the corresponding 
decompressed instructions (step 192). If the instruction is a load/store instruction, then the base register 
specified by the instruction is examined (decision block 1 96). If the base register is the global pointer register, 
the immediate field is decompressed as indicated by reference number 184 in Fig 9 (step 194). Alternatively, 
if the base register is not the global pointer register, the immediate field is decompressed as indicated by 
reference number 182 in Fig. 9 (step 192), 

In addition to decompressing load/store offsets in a different manner for the global pointer register, 
microprocessor 10 also supports a compression mode for indicating which type of instructions are being 
execmed by microprocessor 10 (i.e. compressed or non-compressed). Fig. 1 1 is a block diagram niustrating a 
portion of one embodiment of instruction decompressor 12. The illustrated portion determines the 
compression mode for each routine executed by microprocessor 10. The portion shown may be suitable for 
microprocessor 1 0B, and a similar portion may be employed by microprocessor 10A Fig. 1 1 depicts a mode 
detector 200. 

When an instruction is fetched by processor core 1 6. the instruction is received upon an instruction 
bus 202 by mode detector 200. Mode detector 200 detects when the jump and link f JAL) instruction is fetched, 
and further examines the exchange bit 94. If exchange bit 94 ts set, the routine at the targe: address of the JAL 
instrucnon comprises compressed instructions. Therefore, the compression mode of the target routine is 
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compressed. Alternatively, exchange bit 94 may be clear, in this case, the compression mode of the target 

routine is uncompressed. 

In addition to specifying the compression mode for the target routine, the JAL instruction causes the 
address of the instruction following the JAL instruction to be stored into register S3 1 of the MIPS RISC 
5 architecture. This register may subsequently be used with the JR instruction to return from the target routine. 
Because the compression mode is stored as pan of the address in this embodiment, the compression mode of 
the source routine is restored upon execution of the JR instruction. Advantageously, routines encoded in 
compressed instructions may be intermixed with routines encoded in non-compressed instructions. The new 
compression mode is conveyed to processor core 16 upon a compression mode line 206. It is noted that mode 

10 detector 200 may be included as a pan of processor core 16 instead of instruction decompressor 12, in 
alternative embodiments. 

The embodiment of mode detector 200 shown in Fig. 1 1 includes a storage 204 for a compression 
enable bit. If compression is enabled, the compression enable bit is set. When instructions are fetched in 
compressed mode and co mpres sion is enabled, instruction decompressor 12 decompresses the instructions. If 

15 the enable bit is clear, instruction compression is disabled for microprocessor 10, Instruction decompressor 12 
is bypassed when insmiction decompression is disabled. Furthermore, mode detector 200 mdicates that the 
compression mode is non-compressed when instruction compression is disabled. 

As used herein, a routine is an ordered set of instructions coded for execution by microprocessor 10. 
The routine may be coded in either compressed or non-compressed instructions, and is delimited by a 

20 subroutine call instruction and a return instruction The delimiting subroutine call instruction is not included 
within the routine. Instead, the subroutine call instruction indicates the beginning of the routine via a target 
a^/fr^cc included with the subroutine call instruction. The first instruction of the routine is stored at the target 
address. Additionally, the adders of an instruction within the routine including the subroutine call instruction 
is saved so that a return instruction may be executed to return to the calling routine. In the exemplary 

2 5 compressed instruction set depicted in Figs. 3 A-6F, the jal instruction may serve as a subroutine call 

instruction. Alternatively, the jalr instruction may serve as a subroutine call instruction 

A routine ends with a return instruction, which causes subsequent instruction execution to return to 
the address saved when the corresponding subroutine call instruction is executed. In other words, the target 
address of the return instruction is the saved address. For the exemplary compressed instruction set, the jr 

3 0 instruction may serve as a return instruction Generally speaking, a target address is an address at which 

instruction fetching is to begin upon execution of the instruction corresponding to the target address. 

Turning next to Fig. 12, a block diagram of one embodiment of register field decompression is shown. 
Other embodiments of register field decompression are contemplated. The compressed register field 
corresponding to an instruction is conveyed upon compressed register field bus 210. A register decompressor 

3 5 block 2 1 2 receives the compressed register field. Additionally, at least a portion of the compressed register 

field is incorporated into the decompressed register field which is then conveyed upon decompressed register 
field bus 2U. The decompressed register field is thereby formed by concatenating at least a 
portion of the compressed register field to the value generated by register decompressor block 212. 

In one embodiment, the entire compressed register field is concatenated into the decompressed 

4 0 register field. Additionally, the remaining portion of the decompressed register field depends upon which 

register set the instruction accesses (e.g. xs vs. rs and xi vs. n). A set selector signal is received upon set 
selector bus 216 for each register, indicating whether the xs (Kt) or the rs (rt) register set should be used. If the 
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set selector signal is asserted, then xs (xt) is selected. Otherwise, rs (rt) is selected The set selector signal is 
asserted or Reasserted based upon the opcode of the instruction being decompressed in accordance with the 
exemplary compressed instruction set shown in Figs. 5A-6F. For example, the register mapping between 
compressed and decompressed registers shown in Table 1 may be employed For such an example, register 
decompressor 212 may employ the following logic, wherein DR represents the decompressed register field, CR 
represents the compressed register field, and RH represents the corresponding set selector signal value: 

DR(4:3] = {RH, (RH & CR[2] | !CR(2:0I)} 

Several other register mappings are contemplated examples of which are shown in tables 2-4 below, 
along with corresponding Verilog logic equations. It is noted that any register mapping may be employed by 
various embodiments of microprocessor 10. 
Table 2: Second Exemplary Register Mappings 



Field Encoding 


RS. RT. RD 


XS. XT 


000 


SS(tO) 


$24 (t8) 


001 


S9 (tl) 


$25 (t9) 


010 


$2 (vO) 


$18 (s2) 


on 


S3 (vl) 


$19 (s3) 


100 


$4(a0) 


$28 (£p) 


101 


S5 (al) 


$29 (sp) 


110 


$6(a2) 


$30 (s8) 


111 


$7 (a3) 


$31 (ra) 



DR{4:3] = {RH. (RH & CR[2] | !CR{2:I])) 
Table 3: Third Exemplary Register Mappings 



Field Encoding 


RS. RT. RD 


*S.XT 


000 


$16 (sO) 


$24 (t8) 


001 


SI (at) 


S9 (tl) 


010 


S2 (vO) 


S10 (t2) 


on 


S3 (vl) 


Sll (t3) 


100 


S4 (aO) 


S28 (gp) 


101 


S5 (al) 


S29 (sp) 


110 


S6 (a2) 


S30 (s8) 


ill 


S7(a3) 


S31(ra) 



DR(4:3] = {(RH & CR{2] i !CR[2:0]), RH} 



Table 4: Fourth Exemplary Register Mappings 



Field Encoding 


RS. RT. RD 


XS. XT 


000 


S16 (sO) 


S24 (t8) 


001 


$17 (si) 


S25 (t9) 


010 


$2 (vO) 


$10 (t2) 


on 


$3 (vl) 


Sll (t3) 
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100 $4 (a0) S28 (gp) 

101 S5(al) $29 (sp) 
HO $6(a2) $30 (s«) 
HI $7(a3) $31 (ra) 

DR(4:3] = <(RH & CRT21 1 ! CR(2:1]). RH} 

As indicated by the assembler assigned names shown in tables 1-4. various registers are assigned to 
various functions by software convention. For example, the MIPS assembler assigns the following meanings to 
registers: 



table 5: Software Convention for Register Names 


£eeister 


Software Name 




SO 


none 


Hardwired to zero 


SI 


Sal 


used bv assembler 


S2..S3 


vC-vl 


Function results or static link 


S4..S7 


aO-a3 


arguments for a subroutine 


SS..S15. 


tO-t9 


Temporary registers, not saved berween subroutine calls 


S24..S25 






S16..S23, S30 


sO-sS 


Saved between subroutine calls 


S26..S27 


kO-kl 


Reserved for operating system 


$28 


SP 


Global Pointer 


S29 


sp 


Stack Pointer 


S31 


ra 


Return address 



It is desirable to provide access to both temporary and saved registers to routines coded in compressed 
instructions. Additionally, access to vO-vl . a0-a3. gp, sp, and ra are needed to operate with existing software. 
The register mappings shown balance these qualities with the desire for register decompressor 212 to occupy a 
fcirty small number of gates. Advantageously, a useful set of registers is selected from the MIPS register set 
while still maintaining a low gate count within register decompressor 212. 

Turning now to Fig. 13. an exemplary computer system 220 including microprocessor 10 is shown. 
Many other computer systems employing microprocessor 10 are contemplated. Within computer system 220. 
rmcroprocessor 10 is incorporated onto a semiconductor substrate 224 along with multiple I/O interfaces 222A- 
222N. The- I/O interfaces interface to I/O devices external to substrate 224. An exemplary I/O interface 222A 
may be a universal asynchronous recervex/aansnutter (UART). 

Microprocessor 10 may be coupled to I/O interfaces 222 for communication therewith. Additionally, 
microprocessor 10 may be coupled to external interface logic 226. which further interfaces to one or more 
dynamic random access memory (DRAM) modules 228. DRAM modules 228 may store compressed and/or 
non-compressed instruction code, as well as data for used by the program represented by the compressed and/or 
non-compressed instrucuon code. 

It is noted that the present discussion may refer to the assertion of various signals. As used herein, a 
signal is •asserted" if it conveys a value indicative of a particular condition. Conversely, a signal is 
"deassened" if it conveys a value indicative of a lack of a particular condition. A signal may be denned to be 
asserted when it conveys a logical zero value or, conversely, when it conveys a logical one value. 



Although a specific example of a compressed instruction set is shown and described herein, multiple 
variations, extensions, and modifications may be made to the exemplary compressed instruction set. These 
variations, extensions, and modifications are contemplated. 

The following Verilog listing describes exemplary logic for instruction decompressor 12. Many 
different embodiments of the logic are contemplated, although the Verilog listing shown is one suitable 
example: 



'tzroescaic I ns / 1 ns 

module tinyrisc_dp( xo, ci, dojal, ext, x ); 



'output{31:0] xo; 



input[15:0] 
input 

tTyp ut 

input(10:0] 



a; 

dojal; 

ext; 

x; 



// expanded instruction out 

// compressed instruction in 



//dojal 



//extend 
//extend bits 



wire ext Jal = ext | dojal ; 



/* ci[l< 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 

wire 



decodes •/ 
xsp = cifl5:llj = 51)00000; 
spany » -ext Jal & ci[15] <fc -<i[14] & -d[l I] ; 
splor2 = spany &-d[13] ; 
special = splorl & -ci[12]; 
brjal = d[15:12]=~ 41)0001 ; 
br = bijal ; 

word ci[15] & (-ext | -d[13]) & ci(12] & ci[l 1 J 
speciaD= -ext Jal & d[15:ll] — 51)11101 



half 
opi 
opx 
sll 
rtxO 
snx 

special? 

rirnm 

zxh 

xlz 

x4z 

xn 

ill 

rsza 



/* Sl decodes */ 

wire jr 

wire jair 

wire negnot 

wire rseqO 

wire sp2x 

wire sit 

wire sp3x 

wire sp3sp 

/* | decodes */ 

wire i8 

wire i8s 

wire rdrs 

wire rdn 

wire rdrd 



cifl5] & (-«t | -d[13] | -d[12]) & cif 1 1] & --speciaD; 

- -d[15] & ci[I4] & ^ci[12] ; 
= ci[15:12] — 4-bOOOO | dfl5] & -ci[13] & d[I2] & drill: 
= -ext Jal & 0(15:111 = 51)10110; 

= -exi Jal & ( -^[15] & df]4] & d[12] & ~<d(13] & d[l 11) ); 
« -ci[15]&-<i[14]&(d[13]|^i[12]) | d[15:Ill — 5W001 
~- splor2 &d[12]; 
= -(xspjbr); 

- ext & ximm ; 

= spany & d[12] ; 
= spany | spedal3 ; 

- d[iS:Il] — 5*01111 ; 

- brjal | xn ; 

= -extjal&(ill | sll | d(I5:ll] = 51)01000 ); 



special & d[4:I] = 41)0100' 
jr & d[0); 

special & d[4] & -d[3] &. d[l] & d[01; 

d[8:6] — 31)000; 

special2 & -<i[0] ; 

sp2x&d[I] ; 

speciaD & d[4] ; 

sp3x&d[8:6] = 3oI01 ; 



-ext Jal & -d[15] | sp3sp; 
iS & — il 1 ; 

sp2x<&-ci[l] | spedal&ri[4] 
special & -(d[4) | ci(3]) | jalr ; 
special: & cifOJ | sll ; = 
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wire 


shrs 




wire 


rs5 


m 


wire 


rsz 


XM 


wire 


ms 




wire 


rort 




wire 


snsp3 


ae 


wire 


snl 




wire 


snm 




wire 


snh 




wire 


snll 


SB 


wire 


sn!2 




wire 


xs 




wiye 


xl 





^doial Sl ( brjal | -exi & xn | ci(14:l 1] — 4*b0000 8l d[4:2] = 31)00 & (en ? -<ci[15] | 

ci(5]lx(l0]):d[15])); 
octjallill |jr&^i[0]; 

rsza | sp2x & -a[2J & ~ci[l] | negnot | sp3x & neqO | shxs ; 
-cxtjal & opi I sp3sp ; 
~< cxtjal I ms | jr | -o[15] & -opi ); 
ci[3) <fe special! & -(ci[4] & rseqO); 
snsp3 & — sp3sp; 

ci[I0] & -exijal & snx | snsp3 ; 
snm | ci[10] &-<exiJai&br | x[10] & exti ; 
cxn&half ? cifO] :snh ; 
exn& word?ci[l] : snh ; 
sp2x & ci[4] | sp3x | opx ; 
jalr | sp2x & d[3] 1 sp3x; 

assign xopl] * -dojal & d[ 15] &( ext | -x4z) ; 
assien xof30I = exi &. xsp & x[10] ; 

assiln xof29 - -dojal & d{14] & -< -exi& ci[151 & -ci[12] &~d[ll] ); .„„,u^ 

SpTl - di^|dll2]&(oiKspa n y|ci[151ftci[14]&-dlll])); 
assign xo[26] - dojal | d[ll] ; 

wire[40] re - { rs5 ? d[10:9] : {xs.(d[8] & xs I rseqO)}. d(8:6] }; 
assign xo[25:21J » rs & {5{-rsz}} ; 

wire[4:0] rt - { «. (d[10] ? ffl : -(d[9] | d[5])), d[10:9], d[5] }; 
assign xo(20:16] - rt&{5{rm)} 

| rs&{5<nis}} 

| x(9:5] & {5<extjal}} 

| {rcxO, nxO, 3*b00); 

wire(4:01 id - { I'M. !d[4:2], d[4:2] }; 
assien xo [15:11] - rs&<5<rdrs}} 

| n&{5{rdrt}} 

| rd & {5{rdrd}} 

| x(4:01 & {5{cxt & -ximm | dojal}} 
| { {2{slt | snh)}, snh, snl2, snll }; 

assign xo(10:6] 

= re& {{2{shrs}}. {3{shrsisll}}} 
| d(l5:lll & {5{dojal}} 

| fdm&£d[0)&sll. lT»0,d[10]&i8s.d[91&i8s|d[l]&-exiJal&word} 
| { mm snm. sun, snl, snl } ; 

assign xo[5] = a{5) & (cxtjal | i8) | d[41 & special 

| d[0) & -cxtjal & half | special2 | snl ; 

assign xo[4] =» d[4] & -x4z | d[ll & sp3sp | snl ; 

assign xo(3] - sp3sp ? ci[0] : d[3] &~x2z| sit ; 

assign xo(2] = d[2] & -x2r, 

assign xo[ll = d[ 11 &-( -dojal & word | sll | sp3sp ); 

assign xo[0] - d(0| &-< -dojal & half | sll | sp3sp ) I speoal2 & a[2] ; 

endmodule 

In accordance with the above disclosure, a microprocessor has been described which executes 
instructions from both a compressed instruction set and a non^ompressed instruction set. The microprocessor 
expands the compressed instructions into decompressed instructions or directly decodes' the compressed 
instructions. Advantageously, roudnes coded using the compressed instruction set occupy a smaller amount of 

24 




memory than the corresponding routines coded in non-compressed instructions. Memory fbnneriv occupied by 
such routines may be freed for use by other routines or data operated upon by such routines. 
Numerous variations and modifications win be com e apparent to those skilled in the ait once the above 
disclosure is fully appreciated. It is intended that the following rhjm^ be interpreted to embrace all such 
variations and modifications. 
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CLAIMS : ^ A 

I An appanrus for executing instructions from a variable-length ajmpresseTmstruoioa set composing: 

an instruction decompressor (12A. 12B) coupled to receive compressed instructions (20. 30. 40. 50. 
60. 70. 30. 90) which art members of said variable-length compressed instruction set. 
wherein said instruction decompressor is configured to decompress said compressed 
insOTxeaons into decompressed instructions, and wherein, said instruaion decompressor is 
configured to examine a first opcode field (22) of a particular compressed mstruction, and 
wherein said atssructiou decompressor is configured to determine that said particular 
compressed instruction, is an extended mstrucaou (60. 70. 80. 90) having a first fixed length 
if said first opcode &dd is coded asm 
decompressor is amfi^ 

extended instruction (20, 30, 40. 50) having a second fixed length if said first opcode field is 
coded as a second opcode different than said extend opcode, and wherein said extended 
instruction includes a second opcode filed (66.78). and wherein said instruction decompressor 
is configured to decompress said second opcode field of said extended instruction into an 
decompressed opcode field within a decompressed instruction, and wherein said instruction 
decompressor is configured to decompress said first opcode field of said non-extended 
insttuction into said decompressed opcode field. 

to a »awl «=»=r of ^ cotr^ondmg to said «cood ted ta.pt 

integer factor. 

4. tie apparatus is rcciKd » daim 3 wtarna aid to«S= fa» * 

«„, opcode Sc!d in onte . d«cnnm« ho. <o d«on,r=s aid ««*d instnenoe 

, ^ ««. . <«M * ' * <lS) '° 

decamprtssed Losouctians. 

, „ „ - b, d- . — - - — — — — " ~ 

instruction. 

.^. mdaiml whcrsm a first one of said compressed inactions is ccdable to access 
8. The appancus as recited si Oaun 1. wacran a 



a first subset of registers d^^d for a corresponding aon-compressed ihs^Pbon ser. and wherein a second 
one of said compressed instructions is codable to access said fust subset of registers and is further codable to 
access a second subset of registers. 



9. The apparatus as recited in claim 3 wherein said second one of said compressed instructions is assigned a 
first opcode encoding and a second opcode encoding. 

10. The apparatus as recited in claim 9 wherein said first opcode encoding indicates that said second one of 
said compressed instructions is coded to access one of said first subset of registers. 

1 1. The apparatus as recited in claim 10 wherein said instruction decompressor is configured to decompress 
said second one of said compressed instructions using a first mapping of compressed register codings to 
decompressed register codings, wherein said first mapping maps each compressed register coding to a 
decompressed register coding within said first subset 

12. The apparatus as recited in claim 1 1 wherein said second opcode encoding indicates that said second one 
of said compressed instructions is coded to access one of said second subset of registers. 

13. The apparatus as recited in claim 12 wherein said instruction decompressor is configured to decompress 
said second one of said compressed instructions using a second mapping of compressed register codings to 
decompressed register codings, wherein said second mapping maps each of said compressed register codings to 
a decompressed register coding within said second subset of registers. 

14. The apparatus as recited in claim 9 wherein said second one of said compressed instructions includes a 
first register field (26) and a second register field (32). 

15. The apparatus as recited in claim 14 wherein said instruction decompressor decompresses said first register 
field according to said first subset of registers if said first opcode encoding is used. 

16. The apparatus as recited in claim 15 wherein said instruction decompressor decompresses said first register 
field according to said second subset of registers if said second opcode encoding is used. 

17. The apparatus as recited in claim 16 wherein said second one of said compressed instructions is assigned a 
third opcode encoding, wherein said decompressor decompresses said second register field using said second 
subset of registers if said third opcode encoding is used 

IS. The apparatus as recited in claim 17 wherein said second one of said compressed instructions is assigned a 
fourth opcode encoding, wherein said decompressor decompresses said first register field and said second 
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register field using said second subset of registers if said fourth opcode [encoding is used. 

19. The apparatus as recited in claim 9 wherein said first opcode encoding and said second opcode encoding 
differ in bits included in a function field (28, 34) of said second one of said compressed instructions. 

20. The apparatus as recited in claim I. wherein said instruction decompressor is configured to decompress a 
compressed register field (26, 32) of one of said compressed instructions into a decompressed register field of a 
corresponding decompressed instruction responsive to a first value coded into said compressed register field 
and a second value coded into said first opcode field of said compressed instruction. 

2 1 The apparams as recited in 20 wherein said instruction decompressor is configured to select one of 
multiple mappings from compressed register field encodings to decompressed register field encodings in 
response to said second value. 

22. The apparams as recited in daim 21 wherein said instruction decompressor is configured to select one of 
said decompressed register encodings from said one of said multiple mappings in response to said first value. 

23. The apparatus as recited in claim 22 wherein a particular instruction is assigned a first opcode encoding 
and a second opcode encoding, and wherein said instruction decompressor, upon receipt of said particular 
Lnsmiction having said first opcode encoding selects said one of said-multiple mappings for decompressing 
said compressed register field. 

-> x The apparatus as recited in claim 23 wherein said instruction decompressor, upon receipt of said particular 
unction having said second opcode encoding, is configured to select another one of said multiple mappings 
for decompressing said compressed register field 

25. The apparatus as recited in claim I wherein said instruction decompressor is configured to decompress a 
compressed register field (26. 32) of a compressed instruction into a decompressed register field of a 
decompressed instruction, wherein said instruction decompressor forms a first portion of said decompressed 
register field bv copying at least a portion of said compressed register field thereto, and wherein said 
instruction decompressor includes a logic block (212) which is configured to operate upon said compressed 
register field to produce a remaining portion of said decompressed register field. 

26. The apparatus as reched in claim 25 wherein said portion of said compressed register field comprises an 

cndrsty of said compressed register fiel± 

27 The apparatus as recited in claim 25 wherein said logic block is configured to select a register mapping 
from compressed register 5eld encodings to depressed register field encodings m response to a signal 
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received from said Lnstmcuon decompressor 



28. The apparatus as recited In claim 27 wherein said instruction decompressor is configured to assert said 
signal upon detection of a first opcode encoding within said first opcode field, said first opcode encoding 
assigned to a particular instruction, and wherein said instruction decompressor is configured to deassen said 
signal upon detection of a second opcode encoding within said first opcode field, said second opcode encoding 
assigned to said particular instruction. 

29. A method for expanding compressed instructions into decompressed instructions, comprising: 

detennining that a compressed instruction (20, 30, 40, 50, 60, 70, 30, 90) is an extended instruction 
(60, 70, 80) having a first fixed length if an opcode field (22) of said compressed instruction 
is an extend opcode, wherein a number of bytes included in said compressed instruction is 
defined by said first fixed length if said compressed instruction is said extended instruction, 
and wherein said extended instruction includes a second opcode field (66, 78); 

determining that said compressed instruction is a non-extended instruction (20, 30, 40, 50) having a 
second fixed length if said opcode field of said compressed instruction is a second opcode 
different than said extend opcode, wherein said number of bytes is defined by said second 
fixed length if said compressed instruction is said non-extended instruction; and 

decompressing said compressed instruction into a decompressed instruction, said decompressed 
instruction including a decompressed opcode field, wherein said decompressing includes 
decompressing said second opcode field into said decompressed opcode field responsive to 
said determining that said compressed instruction is said extended instruction, and wherein 
said decompressing includes decompressing said opcode field into said decompressed 
opcode field responsive to said determining that said compressed instruction is said non- 
extended instruction. 

30. The method as recited in claim 29 further comprising fetching compressed instructions in quantities 
defined by said second fixed length. 

3 1. The method as recited in claim 30 further comprising forming said extended instruction, when detected, 
from a first of said quantities in which said extend opcode is delected and a second of said quantities which is 
immediately subsequent to said first of said quantities. 

32. The method as recited in claim 29 further comprises identifying a particular extended instruction using 
said second opcode field. 
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33. The method as recited in claim 32 wherein said decompressing comprises identifying an extended 
inaction format (60. 70. SO) corresponding to said particular extended instruction. 

34 The method as recited m claim 33 wherein said decompressing further comprises interpreting bits within 
said particular extended mstruction according to a plurality of axstrucuon fields within satd extended 
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instruction format 



3 5 The method as recited ax claim 29 wherein said decompressing comprises identtfyiag a non-«tended 
Action format (20. 30, 40. 50) corresponding to said turn-extended mstrucdon from an encoding of satd 



opcode field. 



15 mstruction format 

37. The method as recited in claim 29, further comprising: 

decompressingapardauar compressed t^^.**^™**-™*^ 

register mapping from compressed register indicators to decompressed reg*ter uxdxcators for 
decompressing said first register field if saxd particular compressed ^cunn * encoded 

using a first opcode; and 

d„,»r.r=sir,g aid first Scld if saidpTdcuI* a>n,r= Se d teauaioa is =>cod=d 

using a second opcode. 

38. The . — * d- * — ■ - fcnb " " 

30 register field (26, 32). 

39 me method as recited m daim 3 S fu^er comprismg decompress*^ said second re^st. Seid using said 
second renter mapping if said particular composed mstrucdon is encoded using a tod opcode. 



, , . . m daim 39 fann^ comprising decompressing said second register field and said 
40. The method as recited m claim ^ runner * ^ encoded using 

• rmv ictef maDDtnE if said particular compressed instruction is eacoow 
first register field using said second register mappmg n saw P 



a fourth opcode. 
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4 1. The method as recited in 3Em 37 further comprising decompressing a seRnd pvtictlar compressed 
instruction using said first mapping. 



42. The method as recited in claim 29, further comprising: 

5 

directly copying at least a portion of a compressed register field (26, 32) within one of said 

compressed instructions into a portion of a decompressed register field of a co rr e sp onding 
decompressed instruction; and 

10 ' logically operating upon said compressed register field to produce a remaining portion of said 

decompressed register fidd 

43. The method as recited in daim 42 wherein said portion of said compressed register field comprises an 
entirety of said compressed register field. 

15 

44. The method as recited in daim 43 wherein said portion of said decompressed register field receiving said 
entirety of said compressed register fidd comprises a plurality of least significant bits of said decompressed 
register Odd 

20 45, The method as recited in claim 42 further comprising logically operating upon an opcode fidd (22) of said 
compressed instruction to produce said remaining portion of said decompressed register field 

46. The method as recited in daim 45 wherein said logically operating upon said opcode field comprises 
sdecting a first register mapping in response to a first opcode encoding in said opcode field 

25 

47. The method as recited in rixirry 46 wherein sa vi logically operating upon said opcode field further 
comprises sdecting a second register mapping in response to a second opcode encoding in said opcode fidd 

48. The method as recited in claim 47 wherein said first opcode encoding and said second opcode encoding 
30 are assigned to a particular instruction. 
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49. An apparatus substantially as hereinbefore described as an 
embodiment and as shown in the corresponding accompanying drawings. 

50. A method substantially as hereinbefore described as an embodiment 
with reference to the corresponding accompanying drawings. 
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